A GMM supervector approach for spoken Indian language identification for mismatch utterance length

نویسندگان

چکیده

Gaussian mixture model-universal background model (GMM UBM) supervectors are used to identify spoken Indian languages. The calculated from short-time MFCC, its first and sec derivatives. UBM builds a generalized language model, mean adaptation transforms it duration normalized language-specific GMM. Multi-class support vector machine artificial neural network classifiers labels the supervectors. Experimental evaluations performed using 30 speech utterances nine languages comprised five Indo-Aryan four Dravidian languages, extracted all India radio broadcast news data-set. Eight smaller data-sets were manually derived study effect of training test mismatch. In mismatch conditions, identification accuracy decreases with decrease in train utterance duration. Investigations showed that 32-mixture ANN classifier has optimal performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phonotactic Model for Spoken Language Identification in Indian Language Perspective

Indian Languages are Indo-Aryan being influenced by Sanskrit or Dravidian being influenced by Tamil. Dravidian Languages have the influence of Sanskrit also. All Indian Languages have the influence of Pali language for which the graphemes are being influenced Brahmi. All the Indian languages are phonetic in nature. Every Indian language has its distinctive phone sets. North Indian languages are...

متن کامل

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

Gmm Supervector for Content Based Music Similarity

Timbral modeling is fundamental in content based music similarity systems. It is usually achieved by modeling the short term features by a Gaussian Model (GM) or Gaussian Mixture Models (GMM). In this article we propose to achieve this goal by using the GMM-supervector approach. This method allows to represent complex statistical models by an Euclidean vector. Experiments performed for the musi...

متن کامل

MUESLI: multiple utterance error correction for a spoken language interface

We propose a method for using all available information to help correct recognition errors in tasks that use constrained grammars of the kind used in the domain of Command and Control (CC) systems. In current spoken language CC systems, if there is a recognition error, the user repeats the same phrase multiple times until a correct recognition is achieved. This interaction can be frustrating fo...

متن کامل

New Features for Language Identification Using Gmm

Automatic Language Identification (LID) is the process of identifying the language spoken within an utterance. The challenge that this task presents is that no prior information is available indicating the content of the utterance or the identity of the language. Most of the existing LID systems are based on MFCC feature vectors. This paper introduces the use of new feature extraction approach ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Bulletin of Electrical Engineering and Informatics

سال: 2021

ISSN: ['2302-9285']

DOI: https://doi.org/10.11591/eei.v10i2.2861